A Multi-stage Method for Text-To-Pronunciation Conversion

نویسندگان

  • Ching-Hsien Lee
  • Ren-Jr Wang
  • Chung-Jen Chiu
چکیده

Text-to-Pronunciation conversion is often used for speech synthesis and speech recognition-related systems. In this paper we present a data-driven, language-independent and multi-stage model for Text-to-Pronunciation conversion. With a Grapheme/Phoneme pair well aligned dictionary for training and utilizing a re-scoring strategy for those graphemes likely to be tagged erroneously, our model can not only increase the efficiency but also achieve a high accuracy than other data-driven approaches that have been applied to the same tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Stage DC-AC Converter Based on new DC-DC converter for energy conversion

This paper proposes a multi-stage power generation system suitable for renewable energy sources, which is composed of a DC-DC power converter and a three-phase inverter. The DC-DC power converter is a boost converter to convert the output voltage of the DC source into two voltage sources. The DC-DC converter has two switches operates like a continuous conduction mode. The input current of DC-DC...

متن کامل

Grapheme-to-Phoneme Conversion for Amharic Text-to-Speech System

Developing correct Grapheme-to-Phoneme (GTP) conversion method is a central problem in text-tospeech synthesis. Particularly, deriving phonological features which are not shown in orthography is challenging. In the Amharic language, geminates and epenthetic vowels are very crucial for proper pronunciation but neither is shown in orthography. This paper describes an architecture, a preprocessing...

متن کامل

Grapheme-to-phoneme conversion for Chinese text-to-speech

This paper reports a study of grapheme-to-phoneme (G2P) conversion for Chinese text-to-speech (TTS) system. As Chinese is a syllabic language, syllable is commonly adopted as the phonetic unit in TTS, which is represented by pinyin, the standard Chinese romanization. A Chinese G2P conversion is to find correct pinyin for polyphonic graphemes in the input text. In this paper, a complete G2P fram...

متن کامل

On the Pronunciation of Common Lexica and Proper Names in European Portuguese

This paper presents some relevant aspects of the pronunciation of proper names and common lexica in European Portuguese. It starts by a brief description of statistical data concerning the occurrence and distribution of graphemes and phonemes for the two corpora and the distinction between di erent subclasses found in proper names, namely rst and last names, toponyms and acronyms. The central t...

متن کامل

A Multi-Strategy Approach to Improving Pronunciation by Analogy

Pronunciation by analogy (PbA) is a data-driven method for relating letters to sound, with potential application to next-generation text-to-speech systems. This paper extends previous work on PbA in several directions. First, we have included "full" pattern matching between input letter string and dictionary entries, as well as including lexical stress in letter-to-phoneme conversion. Second, w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006